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Abstract 

This paper describes an R package implementing large sample tests and confidence 
intervals (based on the central limit theorem) for various parameters. The one and 
two sample mean and variance contexts are considered. The statistics for all the 
tests are expressed in the same form, which facilitates their presentation. In the 
variance parameter cases, the asymptotic robustness of the classical tests depends 
on the departure of the data distribution from normality measured in terms of the 
kurtosis of the distribution. 
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1 Introduction 

When you are interested in testing a variance parameter for a large sample in the non Gaussian 
framework, it is not easy to find a test implemented in the standard statistical software. In fact, 
we could not find one! The only available tool is the chi-square variance test tailor-made to 
the Gaussian context. This test is however commonly used by practicians even if the Gaussian 
assumption fails. We applied it to a data set of size n — 1000 with an empirical distribution very 
different from a normal distribution. The p- value of the chi-square variance test with alternative 
hypothesis Hi: a 2 < 1 is 4.79% which leads us to accept the alternative hypothesis at level 
a = 5%. Is it reasonable to use this test when we know that it cannot be u sed in the non 
Gaussian framework because of its sensitivity to departures from normality, e.g. iBoxl ( 1953 



From a mathematical point of view, one may wonder why no alternative test has been yet 
implemented since the asymptotic properties of the sample variance are well-known. Things are 
much easier in the sample mean study because t he one sample t-test is kno wn to be robust to 



departures from normality for large samples, e.g. lOzgur and Strasserl ((2004). This results from 
a direct application of the central limit theorem. The same remarks are valid when comparing 
the two-sample t-test (difference of means test) which is robust for large samples and the Fisher 
test (ratio of variances test) which is not. 

In the statistical framework, one may be a simple user, a tool developer, a theoretician or any 
combination of the above. There are natural interactions between these different communities 
and it is expected that their knowledge should be shared, a bove all for tasks that ha ve now 
become very basic. It is well-known for a theoretician, see e.g. ICasella and Berger (1990), that a 
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common method for constructing a large sample test statistic may be based on an estimator that 
has an asymptotic normal distribution. Suppose we wish to test a hypothesis about a parameter 
9, and 9 n is some estimator of 9 based on a sample of size n. If we can prove some form of the 
central limit theorem to show that asn^ +00, (9„ — &)/a-s — > jV(0, 1) where a~ is a convergent 

(in probability) estimate of Var{9 n ), then one has the basis for an approximate test. This scheme 
based on the central limit theorem will be called the CLT procedure. We have already specified 
that we did not find any alternative to the chi-square variance test for testing a variance when 
the normality assumption fails. On the contrary, the problem of the robustness (to departures 
from normality) of tests for comparing two (or more) variances ha s been widel y treat ed in the 
literature, see e.g. Ho2 (|l953l) . IConover et all (|l981l ). iTiku et all (| 19861) . IPanl (| 19991 ) and the 
references therein. Some alternative procedures to the Fisher test are implemented in R: the 
Bartlett test (bartlett .test), the Fligner test (f ligner .test), the Levene test (levene.test 
available in the lawstat package), etc. However to our best knowledge, for large samples, simple 
alternatives based on the CLT procedure have never been proposed or implemented. 



The main objective of this paper is to propose a unified framework, based on the CLT proce- 
dure, for large samples to test various parameters such as the mean, the variance, the difference 
or ratio of means or variances. This approach also allows direct derivation of asymptotic con- 
fidence intervals. Tests and confidence intervals are then implemented in our new R package, 
called asympTest. This modest contribution also solves the problem of finding a robust (to non- 
normality) alternative to the chi-square variance test for large samples. It also provides a very 
simple alternative to the Fisher test. However, note that the purpose of this paper is not to 
compare our tests to their competitors in terms of power. Finally, a first course of statistical in- 
ference usually presents mean tests in both Gaussian and asymptotical frameworks and variance 
tests restricted to the Gaussian case. The unified approach presented here is is very similar to 
the classical t-test from a mathematical point of view and gives us the opportunity to propose a 
more complete teaching framework with no additional difficulty. 

The paper is organized as follows. Section [2] deals with the mathematical concepts and 
describes our main notation. We also propose a mathematical explanation of the reason why the 
chi-square variance test and the Fisher test are not appropriate even when the sample size is very 
large. Finally, a general framework is also proposed that allows us to derive some asymptotic 
statistical tests for the mean, the variance and the difference (and ratio) of means or variances. 
In Section[3j the R package asympTest is presented. It notably includes the procedures described 
in the previous section. Finally, Sections [4j [5] and [6] are devoted to some discussions and the 
proofs of our results. 



2 Mathematical development 
2.1 Notation 

For one-sample tests, let us denote by Y = (Y\, . . . , Y n ) a sample of n independent and identically 
distributed random variables with mean [i and variance a 2 . These parameters are classically 
estimated by 



£00 = y = -£y< and ^(Y) = — -£(y-/i(Y)) 2 

i=i 
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In the two-sample context, let = fe 1 ', . . . , F^j and Y< 2 > = (y} 2 \ Y^j) denote two 

independent samples of and n' 2 ) random variables with respective means p' 1 -* and p*- 2 ' and 
variances cr 2 ^ and <y 2 2 y We also define the following parameters and their estimated versions by 
denoting Y = (YW, Y( 2 )): 

• Difference of (weighted) means: d p = pW - p x p< 2 > (p £ R) and 3^ (Y) = /xW (Y^) - 
pxpTv (Y( 2 )). 

• Difference of (weighted) variances: d a i = er 2 ^ — px er 2 2 ^ (p e R) and d CT 2 (Y) = a 2 ^ (Y^ 1 )) — 
^§(Y (2) ). 



Ratio of means: — and (Y 



)(Y(2>) ' 



• Ratio of variances: r n i = -Ip and tCB (Y) 

The known parameter p is, in our definition of and d a i , intrinsically nonnegative. But 
note that there is no mathematical problem to deal with negative values of p. 

In order to compare the Gaussian framework and the general one, we propose to denote by 
Y G (resp. Y^ 1 )' , Y ( - 2 - > ' G ) a vector (resp. two independent vectors) of n (resp. rS 1 ^ and n*- 2 -*) 
Gaussian random variables with mean p (resp. p^ and p*- 2 -*) and variance a 2 (resp. er 2 ^ and 

er 2 2 )). In the two-sample context, let us also denote Y G = (YW ,G , Y' 2 ^ ,G ). 

In the sequel, we will use the notation := to define some quantity. For some random variable 
Z and some distribution C, Z ~-> C (resp. C) means that Z follows (resp. approximately 

follows) the distribution C. 

2.2 About the Chi-square and Fisher tests 

In this section, we concentrate on parameters a 2 and r a i . The classical statistics of the chi-squarc 
test of variance and the Fisher test of ratio of variances are defined by 

A^(Y):=(n-l)MJ2 and A G f Y) := ^lil- K 



The notation expresses that the statistic is a measure of the departure of a 2 from a 2 in 

the Gaussian framework. When the data are Gaussian, it is well-known that 

^ a ^ G )^X 2 (n-l) and (Y G ) - T (n« - 1, - 

However, as shown in Fig. [l] both results become untrue (even approximately) under non- 
normality assumption. The theoretical reason may be explained as follows. Let = n and 
assume that there exists a > such that = an' 1 '. Assume also that Y (resp. Y^ and Y^) 
has a finite kurtosis k := E({Y - fx) 4 )/Var(Y) (resp. fcW and fc< 2 )). Then, as n —* +oo, 



A 3 _ B (Y)-(n-l) 



d 



7V(0,fc-l) (1) 



^(a^ 2 (y)-i) tf( 0lk n- 1 + !L—±y (2) 
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This result is a consequence of a more general one stated in Section 12.31 and proved in Section [6] 
Equations (JTJ) and ((2|) lead to the following approximations 



A a (Y)^^ _!,(*_!)(„_!)) and A^. r 2 (Y) Q ^AA ( 1, 



Jfe(l) _ 1 fc(2) _ 1 



X 2 (n- 1) ~Af(n- 1,2(ti- 1)) 



and 



When data are Gaussian ans when n is large, one obtains the well-known approximations 

2 ; 2 

since fc = /c (1) = fc (2) = 3 We can underline that A£L 2 (Y) (resp. ASL r a (Y)) and ASL 2 (Y g ) 

(resp. A2-j r (Y G )) differ in terms of asymptotical variances. More precisely, the gap between 
the two frameworks is essentially governed by the kurtosis. Indeed, asn^ +oo 

Var(A^JY)) k _ 1 



Var(K% ,(YG)) 



and 



Var(A^ r (Y)) fed) _ 1 + h^Ll 



Var(A£~~ (Y G )) 



2(1 + ^) 



Tab. [T] proposes the computations of these asymptotic ratios for different distributions. This 
allows the reader to assess the risk of using the classical statistics AS- and AS— under the 

non-normality assumption even when the size of the sample is large. 





Y £ £ y(2) ^ £ 


Test 


£ = X » 


£ = £(A) 


£ =i/([a,6]) 


Variance test 


i + 5 


4 


2 
5 


Ratio of variances test 


i + 5 


4 


2 
5 



Table 1: Ratio of asymptotic variances (non gaussian/gaussian) and — — . , N ° — in the 



case where k = = k^ and a = 1. 



2.3 Large sample tests based on the central limit theorem 

The parameters a 2 , and d a i can be viewed as particular means. Therefore, the idea (widely 
used in asymptotic theory) is to design asymptotic tests and confidence intervals thanks to the 
central limit theorem. The variables (Y) — r M and f^i (Y) — r a 2 can also be expressed in 
terms of means to which a central limit theorem can be applied. In order to unify asymptotic 
results, we define 9 as one of the parameters /x, a 2 , d^, d a 2, and r a 2. By applying a central 
limit theorem, the law of large numbers and Slutsky's theorem (see Section [6]), one obtains, as 
n — > +oo, 

A^(Y):=^^^AA(0,1), (3) 

where cr^(Y) is the standard error of 0(Y). The assumptions under which the central limit 
theorem can be applied and the definition of er~ (Y) are stated in Tab. [U R functions have been 
implemented to evaluate the standard errors of the estimates of 6, see Tab. |3l 
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Variance test statistic - Chi2(5) 



Ratio of variances test statistic - Chi2(5) 




400 500 600 

DeltaGVar 




I T 
0.6 0.8 



T 



1 .0 1.2 1 .4 
DeltaG RVar 



Variance test statistic - E(1) 




T 



T 



300 



400 



500 600 
DeltaGVar 



700 



n 

300 



Ratio of variances test statistic - E(1) 




r 



0.5 1.0 1.5 2.0 

DeltaG RVar 



Variance test statistic - U(0,5)) 



Ratio of variances test statistic - U(0,5)) 




to ^" 




T 



1 



450 500 550 

DeltaGVar 



i i i r 

0.8 0.9 1.0 1 

DeltaG RVar 



5 

Figure 1: Histograms of m — 10000 replications of test statistics of variance test (left) and 
ratio of variance tests (right) in the Gaussian context. The simulation has been done as follows: 

n = n« = n< 2 ) = 500, Y = = Y^ x 2 (5) (top), £(1) (middle) and U([0,5]) (bottom). 
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Assumptions 






E (Y?) < +X) 




p(Y) 

/ n 


a 2 


E (Y?) < +^ 


\ 


4(y) 

n 




E 






< +00, j = 1,2 




( YW ) , ...^( Y(2) ) 

n(i) P " n( 2 ) 




E 




< +00, i = 1,2 


\ 


n(i) P " n( 2 ) 




^ ^0,E 


« 




< +00, j = 1,2 


M (2)( Y ( 2 >) \ 


/^,( Y(1) ) , r (Y f -f 2 ,(Y< 2 ») 
/ „(D ' c' ) X „<*) 




°f 2 ) ¥= o, E 


« 




< +00, i = 1,2 


^2,( Y(2) ) V 


4^( Y(1> ) , r ^ (Y) 2 ^( Y(2 ») 



Table 2: Standard errors of estimates of 9. For the sake of simplicity we denote by Y := (Y — /i) 2 , 
Y := (Y - fi (Y)) 2 , 4 := Var (y) and 4 ( Y ) := ^ £? =1 ((* - ? 00) 2 - ^ (¥))*. 



6> 


Dataset(s) 


°$ (y) in R 


fj, 


y 


seMean(y) 


a 2 


y 


seVar (y) 




yi,y2 


seDMean (y 1 , y2 , rho=l) 




yi, y2 


seDVar (y 1 , y2 , rho=l) 


r n 


yi,y2 


seRMean(yl ,y2) 


r a 2 


yi,y2 


seRVar(yl,y2) 



Table 3: Standard errors of estimates of 9 in R. 
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Remark 1 The figures Fig. @ Fig. [3| and Fig. [^] allow the reader to illustrate the mathematical 
result (0). 

Remark 2 The asymptotic result (M allows us t o easily construct statistical hypothesis tests and 



confidence intervals, see e.g. Cas ella and Bergen u99(a ) p. 385 for development 



Remark 3 The alternative hypothesis Hi comparing the ratio of means or variances to some 
reference value ro may be expressed in terms of the comparison of the weighted differences d^ :— 
//p) — To/i^ or d a 2 := cr^ — r o&%) ^° ®> ^ e va ^ ue °f P being fixed to ro. 



3 Using asympTest 

The R package asympTest consists of a main function asymp . test and six auxiliary ones designed 
to compute standard errors of estimates of different parameters, see Tab. [3] The auxiliary func- 
tions should not be very useful for the user, except if he/she wants to compute himself/herself 
the confidence interval. The function asymp. test has been written in the same spirit as the 
R functions t.test or var.test. The arguments of asymp. test, its value and the result- 
ing outputs are inspired from the ones of t.test or var.test. In particular, the function 
as ympt.test returns an object of cla ss "htest" (which is the general class of test objects in R, 



see 



R Development Core Team! (|20Q4f )1 



The main arguments of the function asymp. test are: 

• x: vector of data values. 

• y: optional vector of data values. 

• parameter: parameter under testing, must be one of "mean", "var", "dMean", "dVar", 
"rMean", "rVar". 

• alternative: alternative hypothesis, must be one of "two. sided" (default), "greater" or 
"less". 

• reference: reference value of the parameter under the null hypothesis. 

• conf .level: confidence level of the interval (default is 0.95). The type-one error is then 
fixed to 1-conf . level. 

• rho: optional parameter (only used for parameters "dMean" and "dVar") for penalization 
(or enhancement) of the contribution of the second parameter. 

The user may only specify the first letters of the parameter or alternative. 

In order to illustrate this fu n ction , l et us cons i der th e iris data available in R. This famous 
(Fisher's or Anderson's iFisherl <| 19351) : lAndersonl ( 1935f )) data set gives the measurements (in 



centimeters) of the four variables sepal length and width, and petal length and width, for 50 
flowers from each species of iris: setosa, versicolor, and virginica. 

> data(iris) 

> attach(iris) 

> names(iris) 

[1] "Sepal. Length" "Sepal .Width" "Petal . Length" "Petal .Width" "Species" 

> levels (iris$Species) 

[1] "setosa" "versicolor" "virginica" 
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Figure 4: Histograms of m = 10000 replications of e ^-y) ^ or ^ = ct2 ' ^< j2 ' r t J - anc ^ r<:r2 - ^ nc 
simulation has been done as follows: n = = = 500, ~> W([0, 5]), F (2) ~> £(0.5). 





Sepal. Length 


Sepal. Width 


Petal. Length 


Petal. Width 


setosa 


0.4595 


0.2715 


0.0548 


0.0000 


versicolor 


0.4647 


0.3380 


0.1585 


0.0273 


virginica 


0.2583 


0.1809 


0.1098 


0.0870 



The following table presents the p-values of all the Shapiro- Wilk normality tests for the 
different variables and the three species. 

Let us concentrate on the variable Petal. Width for which the Gaussian assumption seems to 
be wrong for each one of the three species. The empirical means and variances are given below: 

> by(Petal. Width, Species, function(e) c (mean=mean(e) , var=var (e) ) ) 
Species: setosa 

mean var 
0.24600000 0.01110612 



Species: versicolor 

mean var 
1.32600000 0.03910612 



Species: virginica 

mean var 
2.02600000 0.07543265 

Is the mean petal width of setosa species less than 0.5 ? 

> require (asympTest) 

> asymp.test(Petal.Width[Species=="setosa"] ,par="mean" ,alt="l" ,ref=0.5) 

One-sample asymptotic mean test 

data: Petal. Width [Species == "setosa"] 
statistic = -17.0427, p-value < 2.2e-16 
alternative hypothesis: true mean is less than 0.5 
95 percent confidence interval: 

-Inf 0.2705145 
sample estimates: 

mean 
0.246 

Is the mean petal width of virginica species larger than the versicolor one ? 

> asymp. test (Petal. Width [Species=="virginica"] , 

+ Petal . Width [Species=="versicolor "] , "dMean" , "g" , 0) 

Two-sample asymptotic difference of means test 

data: Petal. Width [Species == "virginica"] and Petal. Width [Species == "versicolor"] 
statistic = 14.6254, p-value < 2.2e-16 

alternative hypothesis: true difference of means is greater than 
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95 percent confidence interval: 

0.621274 Inf 
sample estimates: 
difference of means 
0.7 

Is the mean petal width of virginica species 4 times larger than the setosa one ? 

> asymp. test (Petal. Width [Species=="virginica"] , 
+ Petal .Width[Species=="setosa"] , "rMean" , "g" ,4) 

Two-sample asymptotic ratio of means test 

data: Petal. Width [Species == "virginica"] and Petal. Width [Species == "setosa"] 
statistic = 8.0936, p-value = 3.331e-16 

alternative hypothesis: true ratio of means is greater than 4 
95 percent confidence interval: 

7.374946 Inf 
sample estimates: 
ratio of means 
8.235772 

This may also be done via a difference of weighted means test. 

> asymp. test (Petal. Width [Species=="virginica"] , 

+ Petal . Width [Species=="setosa"] , "dMean" , "g" , , rho=4) 

Two-sample asymptotic difference of (weighted) means test 

data: Petal. Width [Species == "virginica"] and Petal. Width [Species == "setosa"] 
statistic = 14.6447, p-value < 2.2e-16 

alternative hypothesis: true difference of (weighted) means is greater than 
95 percent confidence interval: 

0.9249653 Inf 
sample estimates: 
difference of (weighted) means 

1.042 

4 Type I error risks 

4.1 Comparison between classical and asymptotic variance tests 

In the context of large samples, two simulation studies are proposed in order to show the lack 
of reliability of the classical tests for variance parameters compared with the asymptotic tests 
studied in this paper. For each of the following examples, 10000 simulations of samples of size 
n = 1000 have been performed. 

1. Let us consider testing H : <r 2 — 1 versus Hi : er 2 < 1 with data sampled from distribution 
£(1) (i.e., under H ). 

In the case where a = 5%, the probability of accepting the alternative hypothesis is 8.91% 
for the asymptotic test and 20.99% for the chi-square test. 
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false 


true 


false 


0.7901 


0.0000 


true 


0.1208 


0.0891 



Table 4: Acceptance of Hi for the chi-square test (rows) versus the asymptotic test (columns) 



2. Let us consider test H : er?^ = criL versus Hi : cr?^ ^ o%) with data sampled from 
distribution U([0, 5]) for both samples (i.e., under H ). 





false 


true 


false 


0.9498 


0.0481 


true 


0.0000 


0.0021 



Table 5: Acceptance of Hi for the Fisher test (rows) versus the asymptotic test (columns) 



In the case where a = 5%, the probability of accepting the alternative hypothesis is 5.02% 
for the asymptotic test and 0.21% for the Fisher test. 

In both cases, the probabilities for Type I errors are worse for the classical tests than for the 
corresponding well-suited asymptotic tests. This is a direct consequence of the previous results 
summarized by figure Fig. [TJ 

4.2 Back to the example of the introduction 

Now, one may wonder what the consequences of these previous results are for pratical purposes. 
Let us consider again the example presented in the introduction. In order to illustrate the two- 
samples case, we propose a second example. In the following R outputs, the data of the first 
(resp. second) example are denoted by y (resp. yl and y2). These samples have a size n=1000 
and their empirical distributions, proposed below, do not seem to fit normal distributions. 








2 4 6 8 012345 012345 

y yl y2 

The following output provides the p-value of the chi-square test for the first example. 

1 > pchisq((length(y)-l)*var(y)/l,lengtri(y)-l) 

2 [1] 0.04785152 

Due to the apparent non normality of the data, one may prefer to apply the corresponding 
asymptotic test: 
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> asymp.test(y,par="var" ,alt="l" ,ref=l) 

One-sample asymptotic variance test 

data: y 

statistic = -0.9771, p-value = 0.1643 

alternative hypothesis: true variance is less than 1 

95 percent confidence interval: 

-Inf 1.070430 
sample estimates: 

variance 
0.8969455 

The two decisions do not match, but since the empirical variance is 0.8969455, one may think 
that tr 2 is slightly inferior to 1. In which case, we have to be cautious because our sample might 
be of the same kind as the 12.08% of Table H 

The following output provides the p-value of the Fisher test for the second example. 

> var .test(yl,y2) 

F test to compare two variances 
data: yl and y2 

F = 0.8874, num df = 499, denom df = 499, p-value = 0.1825 
alternative hypothesis: true ratio of variances is not equal to 1 
95 percent confidence interval: 

0.7444324 1.0578390 
sample estimates: 
ratio of variances 
0.8874061 

Due to the apparent non normality of the data, one may prefer to apply the corresponding 
asymptotic test: 

> asymp.test(yl,y2, "dVar") 

Two-sample asymptotic difference of variances test 
data: yl and y2 

statistic = -2.0925, p-value = 0.03639 

alternative hypothesis: true difference of variances is not equal to 
95 percent confidence interval: 

-0.49988046 -0.01635305 
sample estimates: 
difference of variances 
-0.2581168 

The two decisions do not match, but since the empirical variances are 2.034341 and 2.292458, 
one may think that cr?^ and <7 2 2 -j are slightly different. In which case, we have to be cautious 
because our sample might be of the same kind as the 4.81% of Table [5] 
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5 Discussion 



We have presented an R package implementing large sample tests for various parameters. The 
interesting point is that each test statistic can be written in the same form, as follows: 

A~ (Y) •= ° {Y) - 9 

This form clearly expresses the departure of the estimate from the true parameter, normalized by 
some quantity measuring the precision of the estimate. This approach is then attractive and easy 
to present. In the Gaussian framework, the one and two-sample t-tests follow this idea whereas 
the chi-square variance test and the Fisher test do not. One may wonder if it is possible to embed 
the classical Gaussian framework within this formalism. More precisely, for an estimate 9 (Y) of 



some parameter 9 with standard deviation erg- = yVar(9 (Y)), let us propose the test statistic 

W-^-?^ (4, 
"e "e 

in the case where a-g or possibly some known asymptotic equivalent a-g of a-s only depends on 9, 
or 

A~ (Y) - ^ (Y) - 9 (5) 

otherwise, where a~ (Y) is a consistent estimate of a-g. 

In the large sample framework, the parameters a 2 , cL, r„, d a i and r a i fall into the case of 
equation ^ and the corresponding statistic approximately follows aJV(0, 1) distribution. In this 
same context, the proportion parameter falls into the case of equation Q with ap = \J p(l — p)/n. 

In the Gaussian framework, the parameters /z and leading to the one-sample and two- 
sample t-tests also fall into the case of equation (J5|) . Let us now concentrate on parameters a 2 
and r a 2 (corresponding to the chi-square variance test and the Fisher test). It is known that the 
variance of a 2 (Y G ) and an asymptotic equivalent of the variance of fji (Y G ) are of the form 

2 i (. n-z. 2 \ _ 2 i ^MgV i 2 a! 2) - (/4 2) ) 2 

where fik = e((Y — E(Y)) k ^j is the k— th centered moment of Y (with our notation \i% = a 2 ). 
For Gaussian variables, fi4 — 3(/i2) 2 which leads to 

2 ~"2 r\ 2 i 

cr— = -ct and oy-j = 2r ty2 —ttt + -™ 

Now, in order to build a test, let us give the distributions of AS. (Y G ) and AS— (Y G ) 
expressed in terms of AS- (Y G ) and AS- (Y G ): 

a g fY G,_ -„li ^ 2 -^ 2 „ X 2 (ri-l)-(n-l) 

„2 „2 I 1 I ■ _ 



and 

AS- (Y G ) 



o-ja aV2/(n-l) ^2(n-l) 

(Y G ) - r a , r^A^ (Y G ) - r ff2 JF(n (i) _ 1? „(2) 



r (T 2 V /2/n( 1 ) + 2/n( 2 ) vW+W^ 
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One may propose a name for the previous two free distributions: centered reduced chi-square 
distribution and centered reduced Fisher distribution respectively denoted by Xcr(') an d Fcr('i ')• 
If these distributions were implemented such that one may evaluate quantiles and p- values, one 
could build two new tests directly based on A— (Y g ) and AS-- (Y G ). Of course, these two 

new tests would be strictly equivalent to the classical chi-square variance test and Fisher test 
and would then fall into the same formalism. 

Let us now comment on the concept of robustness in the Gaussian framework. In the par- 
ticular case of mean hypothesis testing, this robustness is expressed by the fact that A~ ^(Y G ) 
is equal to Ap. /J (Y G ) with the same asymptotic distribution A/"(0, 1). When considering the two 
new cases 9 = a 2 or 9 = r CT 2, A~ e (Y G ) is no longer equal to Ag- e (Y G ) but have at least the 

same asymptotic distribution jV(0, 1). However, one can prove that A~ e (Y G ) and A^ e (Y G ) 
are asymptotically equivalent in probability, which may be viewed as some kind of robustness. 

6 Proofs 

In this section, we only prove (|3|). The results |T]) and ([2]) are direct consequences. Recall that 
for each parameter, some assumptions are needed essentially in order to apply the central limit 
theorem. They are summarized in Tab. [2] 
Parameter [i: 

This is a direct application of the central limit theorem. 
Parameter a 2 : 
By definition 

-. n 1 n 



From the CLT, the law of large numbers and Slutsky's Theorem (see e.g. iFergusonl (|1996l )). it 
comes that as n — ► +oo 

V^(F- M ) 2 -^o. 

Therefore, as n — > +oo, 

(o* (Y) - a 2 ) 7V(0, Var((Y - M ) 2 )). 



Since := \J Var ^ ^ can be consistently estimated by (Y) := \j , see Tab. OH we 
obtain as n — > +oo 

Parameter d M : 

Recall that n — n^- 1 ' and — an' 1 '. As n — > +oo, 
v^(?(Y)-d„) = ^(^(Yl")- f .< 1 >)-px^(^(Y«))- fl < 2 )) 
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Since 



can be consistently estimated by 



we obtain as n — > +00 

Parameter d a i: 

As n — > +00, 

V^(3^(Y)-^ 2 ) 



4) ( Y(1) ) , , 4) ( Y(2) ) 



(i) 



+ /9 2 X 



l(2) 



^P(y (1) )-^ 



(i) -PX 



(4) ( Y(2) ) 



'(2) 



2 ., y(2) 



" [°Kv + p x n 



Since 



y(g) 



can be consistently estimated by 



y {1) V / + p2 x y (2) V / 



not 



i 



yw \ J y(2) V 
nW n( 2 ) 



we obtain as n — > +00 



Parameter r M : 



d ff 2 (Y) - d g 2 
^(Y) 



AA(0,1). 



Using Slutsky's Theorem, one may assert that asm +00 

'^(YW) m «\ 



Vn (fjj, (Y) — r M ) 



n— *+oo 



(2) 



M (2) ( Y (2)) M 



v /X( 2 ) (Y( 2 )) 

(Y^) - m w 



^(2) ( Y ( 2 )) M 



(2) 



(i) 



/,(2) 



fa (/x( 2 )) ; 



(V 2 ) - M (2) (y< 2 > 



AT 0, 



+ 



(/|(2)) 2 V(M (2) )V " 
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Since, er— := 





( A 


( M (2))2+ 





can be consistently estimated by 



^(Y) := \ 



1 >M Y( % r ~ (Y) .*M Y(2) ) 



M ( 2 ) (Y( 2 )) 



we obtain as n — > +00, 



Parameter r„ 



(Y) — r M d 



JV(0,1). 



The proof follows the ideas developed for the parameters c 2 , and iv, and thus is left to 
the reader. 
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